Supplementary Material for: Learning Structured Models with the AUC Loss and Its Generalizations
نویسندگان
چکیده
We have shown that with our enhanced representation, the ranking problem for given weights w reduces to the one in Joachims (2005) in the case of a fully-factored model. Here we show a similar result for the learning problem. Recall that our learning objective is defined as: min w λ 2 w 2 + 1 M m max z∈Z w ϕ(x m , z) + ∆ AU C (z, y m) − maxˆz∈Zˆz∼y m w ϕ(x m , ˆ z) (1) We would like to consider this objective in the case of a fully-factored model, where the single element scores are simply: w x i ≡ a i , and therefore: w ϕ(x, z) = i a i k z ki. Our goal is to show that the loss in this case is a simpler function of w. For a specific training example, we denote: z * = argmax z∈Z w ϕ(x m , z) + ∆ AU C (z, y m) (2) ˆ z * = argmaxˆz∈Z,ˆ z∼y m w ϕ(x m , ˆ z) (3) Due to the decomposition of the score and the AUC loss, z * is obtained by sorting a vector which consists of elements a i − c for i ∈ pos and a j for j ∈ neg, with c = 1/(|pos| · |neg|). On the other hand, ˆ z * is obtained by sorting the elements a i for all i ∈ pos, then sorting the elements a j for j ∈ neg, and then concatenating the results with pos before neg. W.l.g. assume that the elements are indexed according to the rankingˆz *. We next define indicator variables for all pairs i ∈ pos, j ∈ neg: y * ij =
منابع مشابه
Learning Structured Models with the AUC Loss and Its Generalizations
Many problems involve the prediction of multiple, possibly dependent labels. The structured output prediction framework builds predictors that take these dependencies into account and use them to improve accuracy. In many such tasks, performance is evaluated by the Area Under the ROC Curve (AUC). While a framework for optimizing the AUC loss for unstructured models exists, it does not naturally...
متن کاملSparse Structured Principal Component Analysis and Model Learning for Classification and Quality Detection of Rice Grains
In scientific and commercial fields associated with modern agriculture, the categorization of different rice types and determination of its quality is very important. Various image processing algorithms are applied in recent years to detect different agricultural products. The problem of rice classification and quality detection in this paper is presented based on model learning concepts includ...
متن کاملModelling potential impacts of climate change on the oak spatial distribution (Case study: Ilam and Lorestan provinces)
Examining the effects of climate change on the oak spatial distribution, as the main species of Zagros forests and its ecological and economic values is of significant importance. Here, we used species distribution models for simulating current climatic suitability of oak and its potential changes in 2050 and 2070. For this purpose, five regression-based and machine learning approaches, four cl...
متن کاملEVELOPMENT OF ANFIS-PSO, SVR-PSO, AND ANN-PSO HYBRID INTELLIGENT MODELS FOR PREDICTING THE COMPRESSIVE STRENGTH OF CONCRETE
Concrete is the second most consumed material after water and the most widely used construction material in the world. The compressive strength of concrete is one of its most important mechanical properties, which highly depends on its mix design. The present study uses the intelligent methods with instance-based learning ability to predict the compressive strength of concrete. To achieve this ...
متن کاملMachine Learning Models for Housing Prices Forecasting using Registration Data
This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...
متن کامل